On transfer inequalities in Diophantine Approximation 



by Michel Laurent 



Abstract — Let O be a point in R"'. We split the classical Khintchine's Transference 
Principle into n — 1 intermediate estimates which connect exponents cOd{Q) measuring 
the sharpness of the approximation to G by linear rational varieties of dimension (i, for 
< d < n — 1. We also review old and recent results related to these n exponents. 



1. Introduction and results. 

We revisit in this note some results of homogeneous transfer in Diophantine Approxi- 
mation. Let us first recall the classical Khintchine 's Transference Principle in its primary 
form [4, 12]. Let n be a positive integer and let O = (6*1, ... , 9n) be a point in R". We shall 
assume in all the forthcoming statements that the real numbers l,9i, . . . ,9n are linearly in- 
dependent over the field Q of rational numbers. Khintchine's Transference Principle relates 
the sharpness of the rational simultaneous approximation to 6i, . . . ,6^ with the measure 
of linear independence over Q of 1, 6'i, . . . , 6*^- Let us first quantify these notions. 

Definition 1. We denote respectively by ujo{Q) and a;j^_i(0) (the meaning of the indices 
and n — 1 wiU be explained afterwards) the supremum, possibly infinite, of the real 
numbers uj for which there exist infinitely many integer {n+l)-tuples [xq, . . . , Xn) satisfying 
respectively the inequation 



max |a;o6'j 

l<i<n 





— LD 


< ( max \xi\ 


1 or 


\0<i<n J 





Xq -|- X\Q\ -|- ■ ■ ■ -|- XnQn\ < ( ^aX \Xi\ 

0<i<n 



Now we can state Khintchine's Transference Principle as follows: 
Theorem 1. The inequalities 



- n + 1 



n 
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hold for any point in R" with l,9i, . . . ,dn hnearly independent over Q. Moreover, both 
inequaUties are optimal. 

It is convenient to view R" as a subset of P"^(R) via the usual embedding 
(xi, . . . , Xn) I— > (1, ici, • • • , Xn)- In the sequel, we shall identify with its image in P"(R). 
Let us introduce for each integer d with < d < n — 1, axi exponent u>diQ) which measures 
the approximation to the point e P"'(R) by rational linear projective subvarieties of 
dimension d, in term of their height. Denote by d the projective distance on P"'(R) (it 
will be defined in Section 2 below ; notice however that the use of any locally equivalent 
distance, such as the distance associated to the supremum norm on R", would lead to the 
same exponents). For any real linear subvariety L of P"(R), we denote by 

d(0,L) = mmd(0,P) 

the minimal distance between and the real points P of L. When L is rational over 
Q, we indicate moreover by H{L) its height, that is the Weil height of any system of 
Pliicker coordinates of L. The Weil height is normalized by using the Euclidean norm at 
the Archimedean place of Q. 

Definition 2. Let d be an integer with < d < n — 1. We denote byajdi^) the supremum 
of the real numbers oj for which there exist infinitely many rational linear subvarieties 
L C P"(R) such that 

dim(L) = d and d(0, L) < H{L)-^-'^. 
Definitions 1 and 2 are consistent, since d(0, L) compares respectively with 

\yo + yiOi-\ \-ynOn\ 



max 

l<i<n 



Xq 



and 



max % 

0<i<n 



when L is either the rational point (case d — 0) with homogeneous coordinates 
{1, xi/xq, . . . , Xn/xo), or the hyperplane (when d = n — 1) with homogeneous equation 

yoXo H h VnXn = 0. 

W. Schmidt was the first to investigate in [21] the properties of these exponents a;d(0). 
In fact, he did not introduce them explicitely ; however Theorems 9-18 of [21] provide us 
with various relations between more general families of exponents, which coincide with the 
exponents ujd{^) in our more limited framework. Note that in the setting of [21], the point 
0, which belongs here to P"(R) = P(R"^"'"^), has to be replaced by the associated line 
in R"^"""-^. The consideration of real linear subspaces of R""*"^ with arbitrary dimension, 
gives then rise to various angles and consequently to further exponents. See Section 4 of 
[2] for a projective formulation similar to the present one. Here is our main result which 
slightly improves on earlier inequalities due to W. Schmidt [21]. 
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Theorem 2. For any point Q with l,6i, . . . ,6n linearly independent over Q, and for any 
integer d with 1 < d < n — 1, we have the estimate 



Following the terminology of [21], we split the above estimate into two parts ; namely 
the Going-up transfer 

(2) u!d+i{B)> — , 0<d<n-2, 

n — d — 1 

and the Going-down transfer 

The Going-up transfer (2) refines Theorem 11 of [21] in our restricted situation. It will be 
established in Section 3. The Going-down transfer (3) is a very special case of Theorem 
10 from [21]. We shall not prove it here again, although a similar argumentation using 
the inner product instead of the wedge product, could be employed. Note that exterior 
and inner products are algebraic translations of the geometrical operations of join and 
intersection for sub varieties in a projective space. 

An interesting feature of Theorem 2 is that it covers Khintchine's Transference 
Principle. Iterating respectively (2) and (3), we easily find by induction on the integer 
k the lower bounds 

(4) iOd+k{&)> + 0<d<n-2,l<k<n-d-l, 

n — d — k 



and 
(5) 



ku)d{&) +d+l 



Thus, selecting respectively in (4) and (5) the pairs {d, k) — (0, n — 1) and {d, k) = 
(n — 1, n — 1), we obtain (1). 

Jarnik [8, 9] has established that Khintchine's Transference Principle is best possible. 
Since it is a formal corollary of all the inequalities occurring in (2) and (3), it follows that 
any of these inequalities is optimal. Let us call spectrum of the function ujd, the set of 
values taken by the exponents (jJd{0), when ranges over R", with l,di, . . . ,dn linearly 
independent over Q. Then, Theorem 2 implies the following 
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Corollary. For any integer d with < d < n — 1, we have the lower bound 

d+1 



(6) ujdiO) > 



n — d^ 



and equahty holds in (6) for almost all © with respect to Lebesgue measure on R"'. The 
spectrum of the function ujd is equal to the whole interval [{d + l)/{n — d), +oo]. 

Proof. We deduce from (4) and (5) the lower bounds 
0Jd[Q) > J and 0Jd{<d) > 



n — d {n — d — l)u)n-i (©) + n 

By Dirichlet's Box Principle, we know that lvq{Q) > 1/n and c<;„_i(©) > n. Then, each of 
the two preceding inequalities imply (6). Moreover, (5) and (4) also provide us with the 
converse estimates 

a>o(e) > , , and iOn-i{e)>{n-d)iOd{e)+n-d-l, 

auJdl'O) + a + 1 

so that uJo{Q) > 1/n and a;„_i(0) > n whenever Ud{0) > {d+l)/{n — d). Now, the easy 
part of the classical Khintchine-Groshev Theorem [7, 11] shows that the set of points O for 
which uo{Q) > 1/n, or equivalently c<;„_i(©) > n, has Lebesgue measure in R". Thus, 
equality holds in (6) for almost all 0. 

As for the spectrum of the functions Ud, notice that all lower bounds (4) turn out to 
be equalities 

ojd+kie) = ~ '^^'^''}^] ^ ^ , 0<d<n-2,l<k<n-d-l, 
n — d — k 

whenever a;„_i(0) has minimal value noJo{Q) + n — 1, with regard to Khintchine's 
Transference Principle (1). Now, Jarnik [9] has established that for any w with 1/n < 
w < +00, there exists a point e R"' such that 

u>o{Q) = w and u>n-i{Q) = nw + n — 1. 

For such a point 0, we then have 

nw + d 

ojd(Q) = r, 0<d<n-l. 

n — d 

It follows that, for any < d < n — 1, the spectrum of the function ujd coincides with the 
interval [(ci+ l)/(n- ci),+oo]. □ 
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Remark. Similarly, the inequalities (5) become equalities when 

^n-i(e) 

- (n-lK_,(e)+n- 
Using now the construction of [8], we obtain in this way a second family 

MQ)= ( ^t^T^ ^ 0<d<n-l, 
[n — a — l)w + n 

of n-tuples {liJo{Q), . . . ,a;n-i(0)) indexed by the real parameter w > n. We are thus lead 
to address the following 

Problem. Find the spectrum in (R U {+00})"^ of the n-tuples 

(uJo{Q),...,UJn-liQ)^, 

when ranges over R"^, with . . . ,$n Unearly independent over Q. 

Necessary assumptions are provided by the inequalities (4) and (5), together with the 
lower bounds (6). Are these conditions sufficient ? It holds true for n = 2, as follows from 
our work [14] whose content is described below. 

Let us now display a different kind of refinement of Khintchine's Transference 
Principle, that has recently been pointed out by Y. Bugeaud and M. Laurent in Theorem 
8 of [2]. Following the "hat" notations of [1], introduce first uniform analogues of the 
above-mentioned exponents a;o(0) and a;^_i(0). 

Definition 3. We denote respectively by a>o(0) ^nd a;„_i(0) the supremum of the real 
numbers uj such that for all sufficiently large real number X , there exists a non-zero integer 
{n-\-l)-tuple {xq, . . . , Xn), with supremum norm max \xi\ < X, which satisfies respectively 

0<i<n 

the inequation 

max \xo9i — Xi\ < X~^ or \xq + xiOi + •••-!- XnOn\ < X~^ . 

l<i<n 

We can now refine Theorem 1 in the following way. 
Theorem 3. The inequalities 

(cDn-i(e)-iK-i(e) < a; re^ < 

((n - 2K_i(e) + iK-i(e) + (n - iK-i(e) - °^ ^ - 

(1 - a;o(e))c^^-i(e) - n + 2 - cDo(e) 
n-1 
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hold for any point Q in R" with l,9i, . . . ,9n hnearly independent over Q. 
The above estimate is stronger than (1), since 

a>n_i(0) > n and a>o(0) > — 

by Dirichlet's Box Principle. 

Notice that Theorem 3 is optimal in dimension n = 2. In that case, the uniform 
exponents loo{Q) and 0)1(0) are linked by Jarmk's equation [10] 

(T) ^„(e) = i-^, 

SO that Theorem 3 reads equivalently 

c.i(e) + ^i(e) - ""'^^^ ^) • 

It is established in [14] that the subset of (R>oU{+oo})^ made up by all possible quadruples 

'cc;i(e),c^o(e), (2^1(6), ^o(e)' 



when ranges over with 1,^1,^2 linearly independent over Q, is essentially described 
by the relations (7-8), together with the obvious lower bound 0)1 (0) > 2. Thus, the transfer 
inequalities (8) cannot be sharpened for generic points 0. 

To end this introduction, let us ask for various extensions of the above results. First, 
Khintchine's Transference Principle has naturally been extended to any system of real 
linear forms [6, 22], and even more generally, to systems of linear inequalities in an adelic 
context [3]. We address the problem of splitting these further transfer relations through 
adequate intermediate exponents, on the model of Theorem 2. That may possibly be 
achieved by improving the transfer results of [21]. In an other direction, one should also 
ask for some refined version of Theorem 2, involving uniform exponents as in Theorem 3. 

Transference principles play an important role in proving Schmidt's Subspace Theorem 
(see e.g. [22]), which extends the classical Roth's theorem on the rational approximations 
to an algebraic number [17]. They also occur in questions of approximation to a real 
number ^ by algebraic numbers of bounded degree. With regard to that topic, it might 
be fruitful to investigate more specifically transfer inequalities between exponents of the 
form u>d{^, . . . , ^"^), as well as their uniform analogues cDd(^, • • • , ^"') introduced (over the 
notation Wd,i{Q) with = (^, . . . ,^")) in Definition 4 of [2]. See the articles [5, 13, 18, 
19] and the reports [2, 20] for relating those exponents with various results on algebraic 
approximation. 
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2. Algebraic formulation 

We reformulate in this section Definition 2 in term of wedge product. One relates the 
exponent uJdiQ) to integer solutions of a system of linear inequations, as in Definition 1. 
This approach will enable us to employ standard arguments from the geometry of numbers. 

First, we equip the real vector space R"'"'"-^ with the usual dot product and extend it 
naturally to the Grassmann algebra A(R'^"'"^), by requiring that for any orthonormal basis 
{ei}o<i<n of R""'""'^, the family of wedge products 

A . . . A Cij, ; < ii < . . . < ifc < n, < /c < n + 1, 

is an orthonormal basis of A(R"'''"-'^). For any X e A(R""''-'^), we denote by |X| its Euclidean 
norm. Now, let P and Q be points in P"^(R) with homogeneous coordinates x and y. The 
projective distance between P and Q is measured by the ratio 



d{P, Q) = 



|xAy| 



m\y\ 

which does not depend on the choices of x and y. Using Lagrange's identity 

|xAy|2 + (x-y)2 = |x|2|y|2, 

we see that d(P, Q) turns out to be the sine of the acute angle determined in R"^"*"^ by the 
two lines Rx and Ry. See [16, 20] for further properties of the projective distance. 

Now, let L be a d-dimensional linear subvariety in P"^(R). Write L — P(V^), where V 
is the {d + 1) -dimensional subspace of R'^^-^ spanned by the homogeneous coordinates of 
the points of L. Select a basis {xq, . . . , x^} of V and put 

X = xo A . . . A Xd. 

Using the canonical basis {ei}o<i<n of R"^"'"^, we may identify A'^+i(R"+i) with K^^^tD , 
The multivector X is then called a system of Pliicker coordinates of L (or of V). Note 
that X is determined up to multiplication by a non-zero real number. It is known that 
the correspondence L X establishes a bijection between the set of d-dimensional linear 
subvariety of P'^(R) and the set of non-zero decomposable multivectors in A'^"'"^(R'^"'"^), 
up to an homothety. 

Lemma 1. Let be a point in P"^(R) with homogeneous coordinates y and let L be a 
hnear subvariety of P'^(R) with Pliicker coordinates X. Then 



d(e,L) = 



|yAX| 
|y||x| 
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d(e,L) 



Proof. Write L = P{V) as above. If y is orthogonal to V, we have 

Otherwise, denote by y' the orthogonal projection on V of the vector y. The minimal angle 
(Ry, Rx), when x ranges along V \ {0}, is clearly reached for x = y'. Therefore 

|y Ay^l 

|y||y'l " 

We may assume without loss of generality that y ^ 1^. Let {ei}i<i<d+i be an orthonormal 
basis of V, which we complete by an orthogonal unitary vector eo to get an orthonormal 
basis {ei}o<i<d+i of Ry© V. Choose X = ei A. . . Ae^+i as a system of Plucker coordinates 
of V. Then |X| = 1. Write now y = aeo + y' for some a e R. We have 

|y /\ y'l = |oi||eo A y'l = |a||y'| and |y A X| = |a||eo A ei A . . . A e^+il = |a|, 

so that 

d(e,L) 



|y||y'l |y| lyll^l 



as required. [] 



Suppose now that L is rational over Q, or equivalently that V can be generated by 
points xq, . . . , belonging to Q""""^. Then X = xq A . . . A x^ has rational coordinates in 

/ n-\-l\ y I 1 \ 

RU+iJ. We define H{L) as the Weil height of the (^^^^j-tuple X. Let us indicate an useful 
interpretation of the height H{L) in term of determinant of a lattice. 

Lemma 2. The group VnZ'^'^^ is a lattice in V. For any Z-basis {xq, . . . , x^} ofVnZ'^'^^, 
put X = xo A . . . A Xrf. Then 

det(VnZ"+^) = |X| = H{L). 

The notation det (FnZ"+^) indicates here the volume, with respect to the induced Eu- 
clidean norm on V, of the unit parallelepiped constructed on any such Z-basis {xq, . . . , x^}. 

Proof. The Z-module VdZ'^'^^ has rank d+1, since V is rational over Q. The multivector X 
is clearly a system of Plucker coordinates of L, and X is primitive in the group A''+^ (Z"+^) 
since {xq, . . . , x^} is a Z-basis oiVC] Z^'^^ . Therefore |X| = H{L). Moreover 



det {V n Z"+^) = /det (x, . X,) ^^^^^ 

V 0<j<c 



IXI 



by Cauchy-Binet formula. We refer to Theorem 1 of [21] for more details and for an 
extension of Lemma 2 to number fields. [] 

Lemmas 1 and 2 enable us to handle more easily the quantities a;d(0) through the 
following alternative 
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Definition 4. Let y be homogeneous coordinates of the point 0. For any d with 
< d < n — 1, the exponent codiQ) is the supremum of the real numbers u> for which 
there exist infinitely many integer decomposable multivectors X e A'^+^(Z'^"'"^) such that 

|y AX| < |X|-'^. 

For any d-dimensional rational linear subvariety L, select a system of Pliicker coordi- 
nates X as in Lemma 2. Then 

H{L) = \X\ and d(e, L) = |y|-^|X|- ^ A X|. 

Now, the equivalence of Definitions 2 and 4 is clear. 

Remark. We should have obtained the same exponent 0)^(0), when dropping in Definition 
4 the assumption that X is decomposable in A'^+-'^(Z"+-'^). We shall not use this property, 
which can be deduced from Mahler's theory of compound convex bodies [15, 22]. To that 
purpose, observe that for fixed positive real numbers Y < X, the convex body defined as 
the set of X in A'^"'"-'^(R"'+^) satisfying the linear inequations 

|X| < y^X and |y A X| < 

compares with the {d + l)-th compound of the convex body defined in R"^+^ by 

|x| < X and |y A x| < Y. 

Further applications to transfer inequalities will be given elsewhere. 



3. Proof of the Going-up inequality. 

We are now able to prove inequality (2). Fix homogeneous coordinates y of and 
follow Definition 4. Let X be a decomposable multivector in A^'^^ {Z'^'^^) such that 

|y AX| < |X|-'^. 

We may suppose without loss of generality that X is primitive in the group A'^+^(Z"^"'"^). 
Then X is a system of Pliicker coordinates of some rational {d + l)-dimensional subspace 
V C R"'"''-'^. By Lemma 2, we may write X = xq A . . . A for some Z-basis {xq, . . . , x^} of 
the lattice V n Z"+^ Specifically, we know from Lemma 2 that 

det(FnZ"+i) = |X|. 
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Let W be the orthogonal complement to V in R"^"""-^. Therefore, W is a rational subspace 
of R"^+^ with dimension n — d. Denote by A the orthogonal projection on W of the lattice 
2n+i_ xj^gn^ A is a lattice in W with determinant 

, det(Z'"+i) 1 

^^^(^) = det(ynzn+i) = p^- 

On the other hand, the Euclidean ball in W 

{zeW; |z|<i?}, 

centered at the origin of W with radius i?, has volume Vn-dR^~'^, where — 7r'^/r(l+/c/2) 
denotes the volume of the unit Euclidean ball in R'^. Choosing now 

Minkowski's convex body theorem shows that there exists a non-zero element ^ G A with 
norm 

\^\<R- 

Lift up now ^ into an element x e Z'^^^ whose orthogonal projection on W is ^, and put 

X' = xAX = ^AX. 

Then X' is a decomposable multivector in A'^+^(Z"^+^). Making use of Hadamard's 
inequality, we first bound from above its norm 

|X'| = |e||x| < i?|X| < |x|("-'^-i)/("-'^). 
A second use of Hadamard's inequality enables us to bound the wedge product 

lyAX'l = |yAC AX| < |C||yAX| < R\X.\-'^ < ^-^^-{u^+i/{n-d)) _ 
Combining the two last inequalities, we obtain 

lyAX'l < \^yj\^-{l,n-d)i^+l)/{n-d-l) _ 

Taking now u arbitrarily close to a;d(©), we have established the lower bound (2). 
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